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IMPROVED METHODS FOR IN VITRO DNA AMPLIFICATION 
AND GENOMIC CLONING AND MAPPING 



Background of the Invention 
5 The field of this invention is the jn ^dtra amplification of nucleic acids. In 

particular, this invention relates to novel methods for cloning and replicating nucleic acids 
in the course of sequencing and physical mapping. It also relates to efficient methods for 
physical mapping of highly complex and lengthy genomes. 

The ability to map and sequence genomic DNA has important applications in the 
10 biological and medical sciences. The localization of human genes responsible for rare 
diseases is an important starting point for gene isolation and cloning, and the identification 
of relevant gene products and mutations, leading ultimately to improved understanding of 
the molecular basis of disease. By identifying genes or regions of hm^an chromosomal 
DNA involved in hereditary forms of cancer, Alzheimer's disease, and other diseases, new 
15 methods of diagnosis and treatment may be developed. 

Mapping a genome refers to pinpointing the location of genes and other features of 
interest on particular chromosomes. Sequencing refers to determining die order of 
nucleotides on tiie chromosomes. The two main types of genome maps are genetic Unkage 
maps and physical maps. Genetic linkage maps are generally made by studying tiie 
20 frequency with which two different traits are inherited togetiier, or linked. Physical maps 
are derived mainly from chemical measurements made on tiie DNA molecules tiiat comprise 
tiie genome. Accordingly, physical maps can be of several different types and include 
restriction maps as well as lower resolution maps obtained by in sM hybridization. All 
these maps share tiie common goal of placing information about genes in a systematic linear 
25 order according to tiieir relative positions along a chromosome. 

Restriction maps are based on sites in DNA fliat are cut by special proteins caUed 
restriction enzymes. Each enzyme recognizes a specific short sequence of nucleotides 
termed a -recognition sequence" and cuts each strand of tiie DNA at some point, termed a 
"cleavage site" or "restriction site." tiiat is witiiin tiie recognition sequence or some distance 
away from it Since many different nucleotide sequences are recognized by one or anotiier 
restriction enzyme, and tiiose sequences are generaUy dispersed randomly tiirooghout a 
genome, a physical map may be constructed by determining tiie relative locations of 
different restriction sites precisely. 

The genome of the bacterium £. soli comprises about 4.7 milUon base pairs. The 
smallest human chromosome is ten times tiiat size, witii tiie complete (haploid) human 
genome comprising about 3 bilUon base pairs. Because of tiie large siie of most genomes, 
the restriction enzymes tiiat are of particular value in restriction mapping are tiiose having 
recognition sites tiiat occur relatively infrequenfly, or are otiierwise distantly spaced 
throughout tiie genome, such tiiat the genome may be cut into relative Urge DNA 
40 fragments, preferably 100,000 to 2 million or more base pairs long. The fragments of DNA 
tiiat are produced upon digestion of a DNA substrate witii a restriction enzyme are termed 
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"restriction fragments." In general, the fewer the number of genomic restriction fragments 
generated, the easier is the task of correctly ordering them in a physical map. 

The largest genome that has been mapped with restriction enzymes that cleave DNA 
infrequently is the single chromosome of E. qqH. Smith, gt gl.. Science 236:1448-1453 
5 (1987). The approach taken in that instance was to digest intact E. goli chromosomal DNA 
with the restriction enzyme Notl, which cuts within an eight nucleotide recognition 
sequence, thereby producing a range of DNA fragments from 15,000 base pairs to 1 million 
base pairs. Most of the information needed to order the DNA restriction fragments came 
from hybridization studies using labelled probes corresponding to E. coK genes that had 
10 previously been cloned and characterized. 

Thus, the first step toward constructing a physical map of a genome generally 
involves the isolation and cloning of discrete genomic DNA fragments. In theory, sensitive 
DNA-probe technologies make it possible to construct physical maps while cloning only a 
small fraction of the genome that is being mapped. In practice, however, such an approach 
15 is suitable only for the coarsest level of physical mapping. At higher resolutions, most 
physical mapping is likely to be carried out by analyzing a collection of overlapping DNA 
clones that cover the entire genome. Physical mapping would entail the ordermg of the 
individual DNA clones according to their positions in the original genome. The individual 
clones are especially useful because they provide an inexhaustible source of the DNA from 

20 each genomic region. Having available a coUection of genomic DNA clones is also a 
prerequisite to determining the nucleotide sequence of the genome, since the clones would 
provide the actual DNA fragments that would be purified and prepared for sequencing. 

The principal limitation on the construction of a high-resolution physical map of a 
genome from an ordered collection of DNA clones is the failure of present methods to 

25 provide the means for cloning very large DNA fragments. Using standard recombinant 
DNA techniques, for example, the size of tiie largest DNA fragment that may be clonally 
propagated in a bacterial host cell is about 50,000 base pairs or less. That result is typically 
achieved by splicing tfie DNA fragment into a cosmid ~ a modified bacteriophage lambda 
vector specifically designed to accommodate DNA inserts on the order of 40,000 to 50,000 

30 base pairs in length. 

The shortcomings of this cosmid cloning system in constructing a physical map of a 
genome are two-fold. First, very large numbers of cosmid clones must be constructed to 
generate a complete set of mutually overlapping DNA fragments for the entire genome. In 
the case of the haploid human genome, for example, it has been estimated that anywhere 

35 from 75,000 to 375,000 or more different cosmid clones would be required for a high- 
resolution physical map. Pines, 1987, "Mapping the Human (Howard Hughes Medical 
Institute, Bethesda, MD). Second, cosmid clones are known to frequently accumulate 
deletions which may result from selection for a shorter size for faster replication and/or 
metabolic imbalances caused by the increased dosage of a particular gene on the cosmid that 

40 affects the growth of the bacterial host cells. Therefore, cosmid clones, especially tiiose 
containing essential genes, are usually difficult to maintain. 



wo 90/10064 



PCr/US90/Q0942 



10 



20 



-3- 

Recently, a method has been described for cloning large fragments of exogenous 
DNA into yeast by means of artificial chromosome vectors. Burke, fit iL, 1987, Science 
226:806-812. The use of this "yeast artificial chromosome" (YAC) cloning system offers a 
significant advantage over the cosmid cloning system in that genomic DNA fragments that 
range in size up to several hundred thousand base pairs may be successfully propagated. 
There are, however, several limitations on the use of the YAC cloning system, particularly 
as it may be appUed to genome mapping. For example, both the number of cloned 
molecules per yeast cell and the number of yeast cells per colony are much lower in YAC 
cloning than in cosmid cloning in bacteria, thus limiting the quantities of cloned DNA 
available for analysis. Furthermore, as with any la mo. cloning method, the DNA 
fragments cloned in YAC vectors can undergo rearrangements or deletions in the host ceU 
that may make it difficult to obtain a set of clones representing the entire genome. 

It is one object of the present invention, therefore, to provide a method for 
efficientiy cloning large fragments of genomic DNA that avoids the problems inherent in 
1 5 the existing in vivQ cloning technology. It is a further object of the present invention to 
provide a method for amplifying a desired nucleic acid sequence, such as that represented 
in a cloned genomic DNA fragment, thereby producing sufficient quantities of die sequence 
for physical or chemical analysis. The ability to carry out this method jn ata makes it 
generally useful for amplifying nucleic acids from any source, without the ^blems of 
selectivity, rearrangements, and deletions that may result from prbpagatii^ an exogenous 
nucleic acid sequence in a host ceU. At the same time, tiie ability to efficientiy amplify 
nucleic acid sequences ranging in size up to several hundred 'fliousand base pairs or more 
makes the metiiod ideally suited for such appUcations as constructing a high-resolution 
physical map of a complex genome or isolating intact medically or biologically important 
2S genes or gene clusters. 

Figures la-lb (referred to hereafter as Figure 1) show 64 oligonudteotide linkers 
having cohesive end sequences complementary to each of tiie possible restrictTon fragment 
cohesive ends resulting from digestion of a genomic DNA substrate the restriction 
endonuclease Sfi I. 

30 The present invention is directed to improved metiiods for cloning genomic DNA and 

for mappmg genomic DNA restriction fragments, and for use in conjunction with or 
independent of tiiose methods, an improved metiiod for amplification of DNA Oat entails 
tiie syntiiesis of novel replicable DNAs comprising a bacteriophage phi29 repUdation origin 
and genomic DNA heterologous to plii29. 

35 The method for cloning genomic DNA comprises the steps of: 

(a) digesting genomic DNA witii a restriction enzyme tiiat generates multiple 
different cohesive-end sequences; 

(b) contacting tiie digested genomic DNA witii a linker, said linker comprising a 
reporter nucleotide sequence and a cohesive-end sequence complementary to 
a cohesive-end sequence generated by the restriction enzyme of siip_ (a), under 
conditions such tiiat a linker is joined to a restiiction fragment b£ genomic 
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DNA; and 
(c) amplifying the product of step (b). 

The method for mapping genomic DNA restriction fragments comprises the steps of: 

(a) digesting genomic DNA with a restriction enzyme that generates multiple 
5 different cohesive-end sequences; 

(b) amplifying the restriction fragment products of step (a); 

(c) determining the cohesive-end sequence of each of said restriction fragments; 
and in accordance therewith, 

(d) aligning the restriction fragments in a physical map. 

10 The method for amplification of a nucleic acid sequence comprises the steps of: 

(a) synthesizing a double-stranded nucleic acid, a bacteriophage phi29 repUcation 
origin and a nucleotide sequence which is heterologous to phi29; and 

(b) synthesizing DNA from said double-stranded nucleic acid under the control 
of said replication origin. 

The advantages obtained by utilizing the methods of the present Invention for cloning 
genomic DNA and mapping genomic DNA restriction fragments as compared to methods 
already known in the art is that the present invention allows the cloning and mapping of 
genomic DNA of virtually any length, without resort to the burdensome and error-prone 
steps of Inserting the genomic DNA Into specialized vectors and replicating it m m&, and 
20 without the need for prior knowledge of the nucleotide sequence or genetic organization of 
any part of the genomic DNA. 

The advantage obtained by tiie method for ampBfication of a heterologous DNA 
under the control of a bacteriophage phi29 replication origin is tiiat tiie in vitm syntiiesis 
of DNA products, in tiie presence of phi29 DNA polymerase and deoxyribonucleoside 
25 triphosphates, occurs continuously, using as a template DNA up to 100,000 base pairs or 
more In lengtii. The metiiod of tiie present invention tiieref ore avoids tiie multiple exacting 
primer hybridization reactions required by tiie polymerase chain reaction (PGR) metiiod of 
Mullis, St al., U.S. Pat. No. 4,683,195. tiiat is commonly used in tiie art. as well as tiie 
inherent limitations of tiie PGR metiiod as appUed to tiie amplification of lengtiiy DNA 
30 substrates. 

The term -genomic DNA" as used herein refers to any DNA comprising a sequence 
tiiat is normaUy present in tiie genome of a prokaryotic or eukaryotic cell or a virus. The 
genomic DNA of a eukaryotic ceU includes, for example, nuclear and extranuclear 
chromosomal DNA. such as tiiat present in mitochondria and chloroplasts. Also included 
witiiin tiie scope of the term "genomic DNA" is any cDNA prepared from a messenger RNA 
or from tfie RNA genome of a virus. Metiiods for tiie extraction and/or purification of 
genomic DNA have been described, for example, by Gross-Bellard, ej al-, 1978, Eur. J. 
Biochem. 2&32-38. Smitii, fiial- 1987. Metii. Enzymol. JLSl:461-489, and Moon, et si.. 
1987, Nuc. Adds Res. 15;61 1-630. Metiiods for the preparation of cDNA are weU known 
in tiie art See, for example. Maniatis. Mai-, Molecular Cloning: A Laboratory Manual (New 
York, Cold Spring Harbor Laboratory, 1982). 
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The term "heterologous" as used herein in reference to a nucleic acid sequence refers 
to a nucleic acid sequence not ordinarily replicated under the control of a bacteriophage 
phi29 repUcation origin. Typically the heterologous nucleic acid sequence will comprise a 
DNA sequence present in the nuclear or extranuclear genome of a prokaryotic or eukaryotic 
5 organism, or the genome of a virus other than bacteriophage phi29. The heterologous 
nucleic acid sequence may be prepared by any suitable method, such as recovery from a 
naturally occurring nucleic acid by use of a restriction enzyme, or ia yifia synthesis, 
including, for example, the synthesis of a complementary DNA (cDNA) from an RNA. 
The term "reporter sequence" as used herein refers to any nucleic acid sequence by 

10 which it is possible to identify, either directly or indirecfly, another nucleic acid molecule 
with which the reporter sequence is associated. The reporter sequence may be of defined 
sequence or defined function or both. For example, if the reporter sequtmce comprises a 
defined nucleotide sequence, the nucleic acid molecule with which it is associated may be 
identified by a hybridization method, wherein the reporter sequence hybridizes with a 

15 complementary nucleic acid that is immobilized on a solid support. The hybridization 
metiiod may be carried out by any suitable metiiod, including tiiose described by Choutelle. 
Stal., 1978, Gene 2;1 13-122, and European Pat App. 0 221 308 (Carrico). Alternatively, 
the reporter sequence may be detected by its function, witii or witiiout knowledge of its 
sequence. Examples include tiie gene for a detectable phenotype, or preferably a replication 

20 origin or a promoter. If tiie reporter sequence comprises a replication origin or a promoter, 
for example, tiie nucleic acid molecule witii which tiie reporter sequence is associated i^ 
readily identified on tiie basis of its repUcation or transcription under tiie control of tiie 
reporter sequence. } 

The term "cohesive-end" as used herein in reference to linkers and restriction 
25 fragments refers to a single-stiand extension at tiie end of a double-stianded nucleic acid 
molecule tfiat extends beyond the region of base-pairing between tiie complementary 
strands. Such single-strand extensions are termed cohesive ends because tiiey are capable 
of hybridizing witii one anotiier sequence tfirough base-pairing of complementary 
nucleotides, tiiereby faciUtating the intramolecular or intermolecular joining of nucleic 
30 acids. The nucleotide sequence of tiie single-strand extension is termed tiie "cohesive- 
end sequence." Restriction fragments having cohesive ends are preferably obtained by 
digestion of double-stianded DNA witii a suitable restriction enzyme, such as any of tiiose 
hereinafter disclosed. Linkers having cohesive ends may be obtained by tiie same metiiod, 
or may be produced syntiietically, such as by tiie annealing of single-stranded 
35 oligonucleotides. 

The term "oligonucleotide" as used herein in reference to linkers and primers is 
defined as a nucleic acid molecule comprised of two or more deoxyribonucleotides or 
ribonucleotides. A desired oUgonucleotide may be prepared by any suitable metiiod. such 
as purification from a naturally occurring nucleic acid or life nQvft syntiiesis. Several 
metiiods have been described in the Uterature. for example, for tiie syntiiesis of 
oligonucleotides of defined sequence using various techniques in organic chemistry. 
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Narang. stal- 1979, Meth. Enzymol. g8;90-I09; Caiuthers, al., 1985, Meth. Enzymol. 
IS^lil; Froehler.fital., 1986, Nuc, Acids Res. 145399-5407. Oligonucleotides prepared 
by any method may subsequently be joined together by Ugation or otherwise to form a 
single oligonucleotide of any required length and sequence. 
5 The term "linker" as used herein refers to an oligonucleotide, whether occurring 

naturally or produced synthetically, which comprises double-stranded DNA. 

The term "primer" as used herein refers to an oligonucleotide, whether occurring 
naturally or produced synthetically, which is substantially complementary or homologous 
to all or part of a nucleic acid sequence to be amplified. The primer must be sufHciendy 

10 long to hybridize with a template nucleic acid comprising the sequence to be amplified, and 
to prime the synthesis of an extension product in the presence of an agent for 
polymerization. Typically the primer wiU contain 15-30 or more nucleotides, although it 
may contain fewer nucleotides. It is not necessary, however, that the primer reflect die 
exact sequence of tite nucleic acid sequence to be ampUfied or its complement For 

1 5 example, non-complementary bases can be interspersed into tiie primer, or complementary 
bases deleted from tfie primer provided that the primer is capable of hybridizing with tiie 
nucleic acid sequence to be amplified or its complement, under tiie conditions chosen. 

The term "origin-primer" as used herein refers to an oligonucleotide whether 
occurring naturaUy or produced syntiietically which comprises a replication origin, as 

20 denned ftartiier herein, joined to the 5' end of a primer. Under suitable conditions and in 
ti»e presence of an agent for polymerization, tiie origin primer is capable of acting as a point 
of initiation of an origin-primer extension product tiiat includes tiie nucleic acid sequence 
to be amplified or its complementary sequence, and tiie repUcation origin of tiie origin- 
primer. 

25 The term "secondary primer" as used herein refers to an oligonucleotide whether 

occurring naturally or produced syntiieticaUy, which under suitable conditions and in tiie 
presence of an agent for polymerization, is capable of acting as a point of initiation for a 
secondary prim» extension product tiiat includes tiie nucleic acid sequence to be amplifled 
or its complementary sequence. 

30 The term "extension product" as used herein refers to a nucleic acid molecule, tiie 

syntiiesis of which is initiated at tiie 3'-0H terminus of a primer, using as a template for 
synthesis the nucleic acid molecule to which the primer is hybridized. 

The term "&gent for polymerization" as used herein is generally understood to refer 
to any enzyme tiiat catalyzes tiie syntiiesis of a nucleic acid molecule from 

35 deoxyribonucleotides or ribonucleotides, using an existing nucleic acid as a template. 
Examples of such enzymes include DNA polymerase, RNA polymerase, and reverse 
transcriptase. 

In one of its aspects, tiie present invention is a metiiod for cloning genomic DNA 
which has as its initial step digesting genomic DNA witii a restriction enzyme tiiat generates 
40 multiple different cohesive-end sequences. Such enzymes are well known in die art and 
include, for example, die restriction enzymes Bgl I, BstX I, Hga I, and Sfi I. 
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The predictable cleavage of double-stranded DNA by a restriction enzyme results 
from the enzyme^s recognition of a certain sequence of base pairs (recognition sequence) 
in the DNA substrate. For any individual restriction enzyme, this specificity is 
characteristic, and essentially invariant among DNAs. The recognition sequences for most 
5 restriction enzymes consist of a specific sequence of four to eight base pairs or more, and 
may additionally include one or more random base pairs. 

In addition to the recognition sequence, a restriction enzyme is also characterized by 
the sites at which it introduces breaks in the phosphodiester bonds of each strand of the 
DNA substrate (cleavage sites) to generate discrete restriction fragments. The cleavage site 
10 for a particular restriction enzyme may occur within the enzyme's recognition sequence or 
some distance away from it. In the case of a restriction enzyme that is capable of 
generating multiple different cohesive-end sequences, the sequence immediately 
surrounding the cleavage sites necessarily includes one or naore random base pairs. 

By way of example, the recognition sequence for the restriction enzyme Sfi I 
15 comprises a specific sequence of four base pairs separated by five random base pairs from 
a specific sequence of four base pairs as shown: 

5' . . . GGCCNNNNNGGCC ... 3' 
3' . . . CCGGNNNNNCCGG . . . y 

T 

wherem N may be any one of the four nucleotides A (adenine), T (thymine), G (guanine), 

or C (cytosine). Sfa introduces staggered cleavages within the recognition sequence at the 

sites indicated by the vertical arrows. Accordingly, each resulting Sfa restriction fragment 

terminates at one or both ends with a cohesive end having the structure 

25 5' ... GGCCNNNN 3* 

3'...CCGGN 5' 

wherein the cohesive-end sequence, 5'-NNN-3\ comprises any one of 64 possible 
trinucleotide sequences. 

In general, if the substrate DNA is a linear molecule, the restriction fragments 

30 resulting from digestion of the substrate DNA with a restriction enzyme that generates 
multiple different cohesive-end sequences will be of two types: restriction fragments that 
comprise one of the original ends of the linear substrate DNA and one restriction enzyme- 
generated cohesive end, and so-called "intemar restriction fragments that comprise two 
restriction enzyme-generated cohesive ends. If the substrate DNA is circular, all the 

35 restriction fragments will comprise two restriction enzyme-generated cohesive ends. 

As will be apparent from this description, both the number of different restriction 
fragments that are produced upon complete digestion of the substrate DNA with a 
restriction enzyme and the number of different cohesive-end sequences that are possible 
for those restriction fragments will depend upon the restriction enzyme that is used. 

40 The number of different restriction fragments that are produced upon complete 

digestion of the substrate DNA with a restriction enzyme in turn depends upon the number 
of occurrences of the restriction enzyme recognition sequence in th6 substrate DNA. The 
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number of occurrences oif a specific recognition sequence may be estimated from the length 

of the restriction enzyme recognition sequence and the size of the substrate DNA. Given 

that DNA is comprised of four different nucleotides, the statistical frequency of a specific 

recognition sequence of length n base pairs is given by the formula: 

frequency = 1 
4° 

For example, a recognition sequence of four base pairs is expected to occur with a 
frequency of 1/256, a recognition sequence of five base pairs is expected to occur with a 
frequency of 1/1024, a recognition sequence of six base pairs is expected to occur with a 
frequency of 1/4096, and so on. The expected number of occurrences of a specific 
recognition sequence in the substrate DNA is then obtained by multiplying the calculated 
recognition sequence frequency by the size of the substrate DNA in base pairs. 

The number of different cohesive-end sequences that are possible among the 
restriction fragments produced by a restriction enzyme capable of generating multiple 
15 different cohesive-end sequences is a function of ti»e number of allowed nucleotide 
variations witiiin tiie cohesive-end sequence. Most commonly, each of the random 
nucleotides witiiin tiie cohesive-end sequence may be any one of the four nucleotides 
adenine, tiiymine, guanine, or cytosine, in which case tiie expected number of different 
cohesive-end sequences is equal to 4», wherein n is tiie number of random nucleotides 
20 within the cohesive-end sequence. 

Any restriction enzyme capable of generating multiple different cohesive-end 
sequences may be used in tiie present invention, altiiough the choice of a restriction enzyme 
in a particular instance will preferably be made on tiie basis of flie size and complexity of 
the substrate DNA. 

25 In general, it is desirable tiiat the recognition sequence of die restriction enzyme 

chosen occur relatively infrequentiy in tiie substrate DNA, in order to reduce tiie number 
of DNA restriction fragments tiiat are produced and tiiat are subject to subsequent 
manipulation and/or analysis. In mapping studies of prokaryotic or eukaryotic chromosomal 
DNA, for example, a restriction enzyme having a recognition sequence of at least six base 
30 pairs wiU usually be required, preferably at least eight base pairs, in order tiiat tiie resulting 
restriction fragments have an average size witiiin tiie range of about 10,000 base pairs to 
100,000 base pairs or more. OptionaUy, tiie number of restriction fragment products may 
be reduced (and tiieir average size correspondingly increased) by adjusting tiie conditions 
under which tiie substrate DNA is contacted witii tiie restriction enzyme (e.g., reaction time 
35 or temperature), such tiiat tiie DNA substrate is only partially digested, witii one or more 
of the restriction enzyme cleavage sites remaining undeaved. 

At tiie same time, it is desirable tiiat tiie restriction enzyme be capable of generating 
restriction fragments of tiie substrate DNA having a sufficient number of different 
cohesive-end sequences. Generally, tiie greater the number of cohesive-end sequences 
generated by tiie restriction enzyme, tiie greater the likelihood of distinguishing tiie 
resulting restiiction fragments from one another on tiie basis of tiieir cohesive-end 
sequences, and in turn, tiie easier it wiU be to clone tiie individual DNA restriction 
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fragments and/or to order them in a physical map. For the digestion of prokaryotic or 
eukaryotic chromosomal DNA. for example, the restriction enzyme chosen wiU usually be 
one capable of generating at least 64 different cohesive-end sequences, which corresponds 
to a cohesive-end sequence comprising a minimum of three random nuclides. 

After digesting genomic DNA with a suitable restriction enzyme, the method for 
cloning genomic DNA has as its next step contacting the resulting DNA restriction 
fragment(s) witii an oligonucleotide linker comprising a reporter sequence and a cohesive- 
end sequence complementary to one of tiie multiple different cohesive-end sequences 
generated by the restriction enzyme. 

The oUgonucleotide linker and tiie DNA restriction fragment(s) are desirably 
incubated togetiier under conditions such tiiat tiie cohesive-end of the oUgonucleotide linker 
hybridizes specifically with tiie complementary cohesive-end of a restriction fragment. The 
linker and a DNA restriction fragment witii which it has hybridized are tiien covalenUy 
joined using any suitable metiiod. preferably by ligation witii DNA Ugase. The presence 
of tiie reporter sequence provides the means for identifying tiie resulting product from 
amongst a mixture of DNA restriction fragments, and preferably provides tiie means for 
isolating and/or selectively amplifying the resulting product. 

If digestion of tiie genomic DNA produces a mixture of different restriction 

fragments comprising different cohesive-end sequences, each differentrestrictionfragment 
of tiie mixture convenienfly may be cloned according to tiie present invention by contacting 
tiie mixture witii a combination of two or more linkers having cohesive-end sequences 
complementary to tiiose of tiie different restriction fragments. 

Because tiie cohesive-end sequences of tiie different restriction fragments of tiie 
mixture usually wiU not be known wiOi certainty, it is desirable under such circumstances 
25 fliat tiie combination of linkers comprise a Bbrary of linkers having cohesive-end sequences 
complementary to substantiaUy every possible cohesive-end sequence generated by tiie 
particular restriction enzyme used. In tiiis manner, it may be assured tiut substantially 
every restriction fragment cohesive-end is joined witii a linker. Dependmg on tiie number 
of different restriction fragments resulting from digestion of tiie genomic DNA. it may 
furtiier be desirable tiiat individual aliquots of tiie mixture be separately contacted witii 
each individual linker from a library of linkers, preferably two or more linkers comprising 
different cohesive-end sequences, in separate reaction vessels, as for example, tiie separate 
wells of a microtiter plate. In this manner, tiie number of different restriction fragments 
which become joined witii linkers in any one reaction may be reduced, in ti^rn reducing or 
eliminating tiie need in a subsequent step to separate each different restriction fragment, 
one from anoflier, in order to achieve tiie cloning of each individual genomic DNA 
restiriction fragment. 

The final step of tiie method for cloning genomic DNA is tiie amplification of a 
genomic DNA restriction fragment to which a tinker and reporter sequence has been joined 
40 by any suitable means, either k vjyfi, such as by repUcation in a host ceU, or is. xina. la 
vittft amplification of a restriction fragment to which a reporter sequence of defined 
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sequence is joined may be accomplished, for example, by the methods disclosed by Mullis, 
Stal-. SUBS, or MiUer, et sIm PCX Publ. No. WO89/06700. published 27 July 1989. usmg 
a primer comprising a sequence complementary to that of the reporter sequence. However, 
neither of these methods is particularly well-suited for the amplification of lengthy DNA 
5 fragments. For example, the method of Mullls, §J gl. is generally limited to the 
amplification of DNA fragments no larger than several thousand base pairs due to the low 
processivity and relatively high error rate of the DNA polymerase used in the amplification 
reaction, ffiguchi, et al„ 1988, Nuc. Acids Res. ifi:7351-7367; Saiki, §t 1988, Science 
22^487-491* Furthermore, the method of Mullis, £tM. requires multiple cycles of nucleic 
10 acid denaturation and primer hybridization, resulting in burdensome labor or automation 
costs. 

In a particularly preferred embodiment of the present invention, the reporter 
sequence used in the cloning of genomic DNA comprises a bacteriophage phi29 replication 
origin and amplification of the restriction fragment to which it is joined is accomplished 

IS in vitro using bacteriophage phi29 DNA polymerase. 

The term "replication origin" generally refers to a sequence in DNA at which DNA 
synthesis is initiated by an agent for polymerization. Gutierrez, £j il., 1988, Nuc. Acids 
Res., 1^:5895-5914, disclose that the minimal phi29 replication origins utilized by phi29 
DNA polymerase are located within the terminal 12 base pairs at each end of phi29 DNA. 

20 The so-called "left origin of replication" comprises the sequence 5*-AAAGTAAGCCCC- 
3' and the "right origin of replication" comprises the sequence 5*-AAAGTAGGGTAC- 
3% with replication proceeding in the 5' - 3' direction as shown. However, because 
nucleotide changes may be made at one or more positions within these sequences without 
abolishing their function, all such sequence variants are likewise included within the scope 

25 of the term "bacteriophage phi29 replication origin" as used herein. 

Blanco, filal., 1984, Gene 22:33-40, and Blanco and Salas, 1984, Proc. Nat. Acad. Sci 
Sl;5325-5329, together describe the purification of bacteriophage phi29 DNA polymerase 
from £. sqU cells transformed with a recombinant plasmid containing the plii29 DNA 
polymerase gene. Garcia, fit al-. 1983, Gene 21;65-76, and Prieto, fit Mm 1984, Proc. Nat. 

30 Acad. Sci il;1639-1643, together describe the purification of bacteriophage phi29 terminal 
protein from £. sqU cells transformed with a recombinant plasmid containing the phi29 
terminal protein gene. 

Blanco and Salas, 1985, Proc. Nat. Acad. Sci. 22:6404-6408, and Blanco, fit al., 1988, 
in "EMBO Workshop - Gene Organization and Expression in Bacteriophages," at p.63, 

35 describe the replication of bacteriophage phi29 DNA in the presence of purified phi29 
terminal protein and phi29 DNA polymerase. The repUcation of double-stranded 
bacteriophage phi29 DNA is initiated at the replication origins present at both ends of the 
DNA by a protein priming mechanism. In the presence of phi29 terminal protein, phi29 
DNA polymerase and the f ur deoxynucleoside triphosphates, a terminal protein-dAMP 

40 initiation complex is formed that can be elongated to full-length phi29 DNA by phi29 DNA 
polymerase by a mechanism of strand displacement. 



wo 90/10064 



PCr/tJS90/00942 



In another of its aspects, therefore, the present invention provides a method for 
amplifying heterologous DNA sequences on the order of 10,000 base pairs to 100,000 base 
pairs or more in length, as weU as heterologous DNA sequences less than 10,000 base pairs 
in length, which method utilizes a bacteriophage phi29 replication origin and highly 
5 processive bacteriophage phi29 DNA polymerase. In accordance wif n the present invention, 
it is now feasible to amplify any DNA sequence by incorporating at- one or both ends 
thereof a bacteriophage phi29 repUcation origin. Thus, the subject invention provides new 
DNA constructs that do not naturally occur and that can be replicated uider the control of 
a bacteriophage phi29 replication origin. 
10 The bacteriophage phi29 replication origin may be obtained from natural sources or 

may be produced syntheticaUy, and may be joined to the end of a heterologous DNA 
fragment in a ligation reaction with DNA ligase or otherwise introduced adjacent to a DNA 
sequence to be amplified by methods known in the art, such as site directed mutagenesis, 
or through the use of an origin-primer according to the methods generaUy described by 

15 MuUis, si al-. snora . 

Amplification of a heterologous DNA sequence under the control of a bacteriophage 
phi29 replication origin typically wiU be performed iQ viJmin a buffered aqueous solution, 
in the presence of bacteriophage phi29 terminal protein, bacteriophage phi29 DNA 
polymerase, and the four deoxynudeoside triphosphates, and under suitable condidons of 

20 temperature, salts concentration (e.g., Mg**. NH*), and pH such that a multiplicity of full 
length copies of the heterologous DNA are synthesized. 

Because repUcation from a bacteriophage phi29 replication origin proceeds 
unidirectionally, using as a template one strand of a double-stranded DNA, the 
amplification of a double-stranded heterologous DNA is accomplished by joining to both 

25 DNA ends a linker comprising a bacteriophage phi29 repUcation origin, such that both 
strands of the DNA template are repUcated. Accordingly, the attiplifipation of a genomic 
DNA restriction fragment is preferably accomplished by joining lo both ends of the 
restriction fragment linkers comprising a bacteriophage phi29 replication origin. A linker 
comprising a bacteriophage phi29 replication origin is conveniently referred to as a ''phi29 

30 origin linker." 

In one embodiment of the invention, the genomic DNA lestrictiwi fragment will 
comprise two restriction enzyme-generated cohesive ends and the phi29 6rigin linkers 
preferably win each have a cohesive-end sequence complementary to one of the ends of the 
genomic DNA restriction fragment. In another embodiment of the invefition the genomic 

35 DNA restriction fragment will comprise one of the original ends of the geuomic DNA 
substrate and one restriction enzyme-generated cohesive end. Typically the original end 
of the genomic DNA substrate will be a blunt end, in which case amplification of the 
genomic DNA restriction fragment preferably wiU be. accomplished by joining to the 
original end a phi29 origin Unker having a blunt end, and joining 1^ tiie lestiiction- 

40 enzyme generated cohesive end a phi29 origin linker having a complementary cohesive end. 
The term "blunt end" as used herein refera to an end of a double-stranded nucleic acid that 
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lacks any single-stranded overhang. If the original end of the genomic DNA substrate is 
not a blunt end, it is preferably converted to a blunt end. for example, by removal of any 
single-stranded overhang with SI nuclease, or another suitable single-stranded exonuclease, 
or by filling in the original ends according to known methods. See, for example, Maniatis. 
5 al., supra . 

It will be appreciated by those of ordinary skill in the art that the method of 
amplification disclosed herein, by which a heterologous DNA is replicated under the control 
of a bacteriophage phi29 replication origin, has applications in many different aspects of 
molecular biology, and also in the diagnostic arts, and is not limited to use in conjunction 
10 with the methods of cloning and mapping genomic DNA of the present invention* For 
example, by joining a bacteriophage phi29 replication origin to one end of a double- 
stranded DNA, a means is provided for synthesizing a multiplicity of single-stranded DNAs 
suitable for such uses as templates for DNA sequencing or as hybridization probes. 

If in an amplification step two or more different genomic DNA restriction fragments 
15 are BXOpUfitd in the same reaction, it will be necessary to separate the different restriction 
fragments from one another in order to produce individual genomic DNA clones. 
Separation of the different restriction fragments may be accomplished subsequent to 
amplification by any of the known methods for physically separating nucleic acids, such as 
chromatography, centrifugation, or electrophoresis. Lengthy DNA molecules, in the size 

20 range from about 10,000 base pairs to 100,000 base pairs or more are preferably separated 
by the technique of pulsed field gel electrophoresis. Carle and Olson, 1984. Nuc. Acids. 
Res. 12;5647-5664; Smith, fit al., 1986, Genetic Engineering 1:45-70 (Plenum Press, New 
York). The separated DNA molecules can then be isolated in any known manner and, if 
desired, the isolated DNA molecules again may be amplified to produce a multiplicity of 

25 copies of each individual genomic DNA clone. 

Alternatively, the different DNA restriction fragments may be labeled with moieties 
capable of producing, either directly or indirectly, different detectable signals, as for 
example by incorporating such moieties in the different linkers that are joined with the 
DNA restriction fragments, and physicaHy separating the DNA restriction fragments on the 

30 basis of those different signals. Thus, for example, the detectable moiety can be a 
fluorophore, and the DNA restriction fragments separated by flow cytometry. Gray, fital^ 
1987, Science 238:323-329. 

In yet a further aspect, the present invention provides efficient means for the 
physical mapping of genomic DNA restriction fragments having multiple different 

35 cohesive-end sequences. After digesting genomic DNA with a restriction enzyme capable 
of generating multiple different cohesive-end sequences, the relative positions of the 
resulting restriction fragments within the substrate genomic DNA are established by 
determining the sequence at one or both ends of each different restriction fragment, 
including at least the sequence of each cohesive-end. Provided the cohesive-end sequences 

40 differ between each different restriction fragment, alignment of the restriction fragments 
is conveniently accomplished by taking advantage of the fact that restriction fragments that 
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are adjacent within the genomic DNA substrate, and therefore share common cleavage sites, 
will have complementary cohesive-end sequences. 

The nucleotide sequence at the end of a genomic DNA restriction fragment may be 
determined directly by any of the known methods of nucleic acid sequencing, for example, 
the chemical degradation technique of Maxam and Gilbert, 1980. Meth. Enzymol. 151:499-' 
560 or the chain termination technique of Sanger, gl., 1977, Proc. Nat. Acad. Sci. 
24;5463-5467. or may be determined indirectly from the cohesive-end sequence of a 
linker(s) that is joined to the restriction fragment. 

The manipulation and analysis of accumulated sequence data, particularly the aligning 
of individual genomic DNA restriction fragments in a physical map. is conveniently 
accomplished using a computer method. For example, Staden. 1982. Nuc. Acids Res. 
lll;4731-4751, describes a computer method for handling DNA sequence information, and 
aligning DNA restriction fragment clones that are related to one another by overlap of their 
sequences. 

15 The following examples are offered by way of iUustration, and are not intended to 

Umit the invention in any manner. AU patent and Uterature references described herein are 
expressly incorporated. 

Examples 

Digestion of Adftnnvinis-2 DN^ ^fh .vfn 

20 Purified adenovirus-2 DNA (International Biotechnologies, Inc.. New Haven. CT) is 

digested with restriction endonuclease Sfil (New EngUmd BioLabs, Beveriy, MA) in a 
reaction consisting of lOmM Tris-HCl (pH 7.9), lOmM MgCl^, lOmM 2-mercJptoetiianol. 
50mM Naa. lOO/ig/ml bovine serum albumin. 20/ig/ml adenovinis-2 DNA. and 5 units 
5/1 1 in a total reaction volume of 100 /il. for 1 hour at 50'C. 

25 Removal of AdBnnvimg- 2 Terminal Pr^ ff 

Adenovirus-2 DNA contains a virus-encoded protein covalenfly bound to tiie 5* 
terminus of each strand of die linear DNA molecule, removal of which is necessary to 
produce ends capable of ligation with linkers. Stillman, fit M.. 1981, CeH 22;497-508- 
Tamanoi. £i si-. 1982. Proc. Nat. Acad. Sci. 22:2221-2225. To separate die 5* terminal 

30 protein from adenovirus-2 DNA, 20 ftg/ml adenovirus-2 DNA in 10mm Tris-HCl (pH 7.5) 
is mixed witii an equal volume of IM piperidine. incubated for 2 hours at 37'C. than 

lyophilized, dissolved in water and lyophilized again. 

Oligonacleoririft Sviit^^,^^. 

The smgle-stranded 12-mer oUgonucleotide 5* GGGGCTTACTTT 3' and each of 64 
35 different single-stranded 15-mer oligonucleotides having tiie general sequence 5' 
AAAGTAAGCCCCNNN 3\ wherein N is any of tiie four nucleotides adenine, tiiymine, 
guanine, and cytosine. are syntiiesized on a Biosearch Model 8600 DNA SynAesizer in' 
accordance with the metiiod of Froehler, sL ll-. SJiEia. T prepare linkers, each of tiie 
different 15-mer oligonucleotides is annealed witii tiie complemehtary 12-mer 
40 oUgonucleotide in hybridization buffer, consisting of lOmM Tris-HQ (pH 7 J), lOmM 
MgClj. 20mM Naa. at a temperature between 25« and 30*0. The nucleotide sequ^ces of 
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the 64 linker products, having the general structure 

5' AAAGTAAGCCCCNNN 3* 
3' TTTCATTCGGGG 5* 

are shown in Fig. 1. 

5 Cloning of the Adenovirus-2 Genome 

Adenovirus-2 terminal protein is removed from the ends adenovirus-'2 DNA as 

described above. Linkers comprising two bacteriophage phi29 left origins of replication in 

a tail-to-tail orientation, and having the sequence 

y AAAGTAAGCCCCGGGGCTTACTTT 3^ 
10 3* TTTCATTCGGGGCCCCGAATGAAA 5' 

are then joined to the resulting blunt ends of the adenovirus-2 DNA, using 100 pmoles of 

linker per microgram of adenovirus-2 DNA in ligation buffer consisting of SOmM Tris- 

HQ (pH 7.8), lOmM Mga2, 2mM dithiothreitol, ImM ATP, ImM spermidine, and 50 

;ig/ml bovine serum albumin. 2 units of T4 DNA ligase (New England BioLabs, Beverly, 

15 MA) are then added per microgram of adenovirus-2 in the reaction mixture, and the 
reaction mixture is then incubated at 15*C for one hour. 

Following ligation of the bacteriophage phi29 origin linkers to the original ends of 
the adenovirus-2 DNA, the adenovirus-2 DNA is digested with Sfi I as described above. 
The cloning of the resulting Sfi I restriction fragments is then conveniently carried out 

20 according to the methods of the invention in the individual wells of 96-well microtiter 
plates. 

To each separate well of the microtiter plates is added 100 pmoles each of two 

different linkers of Fig. 1, such that every possible combination of two different linkers of 

Fig. 1 is contained in one or another microtiter well. The number of different combinations 

25 of the 64 linkers of Fig. 1, and thus the number of microtiter wells that need be prepared 

for the cloning of Sfi I restriction fragments, is 2016. In general, the number of different 

combinations of N different linkers may be determined according to the formula: 

number of combinations « N ^ - N . 

2 

30 1 fig of Sfi I digested adenovirus-2 DNA is then added to each separate microtiter 

well in 25 fil of ligation buffer together with 2 units of T4 DNA ligase (New England 
BioLabs, Beverly, MA). The reaction mixture is incubated at IS^C for one hour to allow 
the ligation of linkers to the cohesive-ends of the adenovirus-2 DNA restriction fragments. 
The ligation reaction is stopped by heating for 15 minutes at 68*C. 

35 Following ligation of linkers to the adenovirus-2 DNA restriction fragments, 

amplification of the adenovirus-2 restriction fragments is accomplished by adding to the 
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reaction mixture in each microliter weU lOfiM each dATP, dTTP. dGTP, and dCTP, 20mM 
(NH JjSO^, 5% (vol/vol) glycerol, 300 ng of purified bacteriophage phi29 tUnal protein, 
and 20 ng of bacteriophage phi29 DNA polymerase. After incubation for 2 hours at 30'C.,* 
the DNA synthesis reaction is stopped by the addition of lOmM EDTA and heating for 10 
minutes at 68"C. 

FoUowing polyacrylamide gel electrophoresis of the DNA synthesis reaction mixtures, 
each of the amplified adenoYirus-2 DNA restriction fragments is visualized by staining th^ 
gel with ethidium bromide, and then purified from the gel by electroelution. Sequencing 
of the ends of each strand of the restriction fragments is carried out according to the 
method of Maxam and Gilbert, siuaa, using a«p.cordycepin-5'-triphosphate for y-end 
labeling. Tu. si ll., 1980, Gene Jfi:l 77-1 83. 
Physical Manning of tha Arienovims-?. n^nm^ 

Digestion of adenovinis-2 DNA with S/i I generates four restriction fragments, 
ranging in size from approximately 1000 base pairs to approximately 16.000 base pairs. The 
nucleotide sequence at the 3' ends of the each of the restriction fragments is as foUows: 



Sequence 





1 


5* 
3» 


6TA ! ! 


> • • • • 
» • • • • 


T6C 


3' 
5' 


20 


2 


5» 




» • • • • 


GAC 


3' 




3* 


GAC ! i 


► • • • • 




5' 




3 


5' 
3» 


CTG ! '. 


> . . AT6 


3» 
5* 






4 


5« 
3« 


ACG . '. 


. . CTG 


3* 

5' 





in a physical map is accomplished by 



Alignment of the restriction fragments 

analyzing the nucleotide sequence of each restriction fragment for compl^entory^'-end 
sequences. Accordingly, the four Sfi I restriction fragments of adenovirus-2 DNA are 
aligned as follows: 
5« 
3' 



GTA . . . 
(fragment 



TGC 3* 
5« 



1) 

5» 
3' 



ACG .... 

(fragment 4) 

5« 
3' 



CTG 3» 
5« 



6AC ... 

(fragment 2} 



GAC 



5« 
3» 



3» 
5' 



• . . . AT6 3« 
CTG .... 51 

(fragment 3) 

CloninB of an Internal Sfi I FragmP.^^ Adun vir^ . f-o n^fi 

This example iUustrates an embodiment of the invention wherein a genomic DNA 
restriction fragment having a predetermined cohesive-end sequence is cloned. 

Adenovirus-2 DNA is digested with Sfi I as described above. From the known 
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nucleotide sequence of the adenovirus-2 genome* Roberts, et al., 1986, in "Adenovirus 

DNA," W. Doerfler (ed.) (Martinus Nljhoff Publishing, Boston, MA), it may be predicted 

that one of the resulting restriction fragments^ extending from nucleotide 17305 to 

nucleotide 23046 of the adenovirus-2 genome, will have the following structure: 

5 5' CGGC . . . GCCGGAC 3' 

3* GACGCCG . . . CGGC 5\ 

As the first step in the in vitro cloning of this restriction fragment, 1 ^g of Sfi I 

digested adenovirus-2 DNA is mixed with 100 pmoles of oligonucleotide linker having the 

sequence 

10 y AAAGTAAGCCCCCTG 3' 

y TTTCATTCGGGG 5' 

and 100 pmoles of oligonucleotide linker having the sequence 

y AAAGTAAGCCCCGTC 3* 
3, TTTCATTCGGGG 5* 

15 in 25 Ml of ligation buffer. 2 units of T4 DNA ligase are then added to the mixture and the 
ligation reaction allowed to proceed for one hour at 15*C. 

Following ligation of the linkers to the complementary cohesive-ends of the desired 
adenovirus-2 DNA restriction fragment, amplification of the restriction fragment is 
accomplished by adding to the ligation reaction mixture 20/xM each dATP, dTTP, dGTP, 

20 and dCTP, 20mM (fm^)^0^, 5% (vol/vol) glycerol, 300 ng of purified bacteriophage phi29 
terminal protein, and 20 ng of bacteriophage phi29 DNA polymerase. After incubation for 
20 minutes at 30*C., the DNA synthesis reaction is stopped by the addition of lOmM EDTA 
and heating for 10 minutes at 68''C. 
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Claims 

1. A method for amplification of a heterologous nucleic acid sequence which 
comprises: 

(a) synthesizing a double-stranded nucleic acid which includes the 
5 heterologous sequence and a bacteriophage phi29 replication origin; and 

(b) synthesizing DNA from said double-stranded nucleic acid under the 
control of said replication origin. 

2. The method of claim 1 wherein the double-stranded nucleic acid of step (a) 
is obtained by ligation of a bacteriophage phi29 repUcation origin to a double-stranded 

10 nucleic acid. 

3. The method of claim 1 wherein the double-stranded nucleic acid of step (a) 
is obtained by steps comprising: 

(1) providing an oligonucleotide origin-primer comprising a bacteriophage 
phi29 replication origin joined to a primer, and an oligonucleotide secondary primer, 

15 wherein said primers are not complementary to one another, tod are selected such that the 
extension product synthesized from one primer, when separated from its complement, can 
serve as a template for synthesis of the extension product of the other primer; 

(2) contacting a nucleic acid under hybridizing conditions with the origin- 
primer such that an origin-primer extension product is synthesized; and 

20 (3) contacting a nucleic acid under hybridizbig conditions with the 

secondary primer such that a secondary primer extension product is synthesized. 

4. The method of claim 1 wherein replication originates at one end of said 
double-stranded nucleic acid and the DNA products of step (b) are single-stranded. 

5. The method of claim 1 wherein replication originates at both ends of said 
25 double-stranded nucleic acid and the DNA products of step (b) are double-stranded. 

6. A method for amplification of a nucleic acid sequence heterologous to 
bacteriophage phi29 which comprises: 

(a) providingadouble-strandednucleicacidwhichincludestheheterologous 

sequence; 

30 (b) providing a DNA linker which includes a bacteriophage phi29 replication 

origin; 

(c) joining said linker to said double-stranded nucleic acid; and 

(d) contacting the product of step (c) with bacteriophage phi29 DNA 
polymerase, bacteriophage phi29 terminal protein, and four different deoxynucleoside 

35 triphosphates under conditions such that DNA is synthesized. 

7. The method of claims 1 or 6 wherein the bacteriophage phi29 replication origin 
comprises the sequence 5' AAAGTAAGCCCC 3\ 

8. The method of claims 1 or 6 wherein the bacteriophage phi29 replication origin 
comprises the sequence 5* AAAGTAGGGTAC 3'. 

^0 9. The method of claims 1 or 6 wherein the heterologous nucleic acid sequence 

is eukaryotic. 
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10. The method of claims 1 or 6 wherein the heterologous nucleic acid sequence 
is human. ^ 

11. The method of claim 6 wherein the DNA linker of step (b) is prepared by in 
vitro synthesis. 

5 12. A replicable DNA comprising a bacteriophage phi29 replication origin at one 

or both ends of a heterologous DNA sequence. 

13. The product of claim 12 wherein the heterologous DNA sequence is eukaryotic. 

14. The product of claim 12 wherein the heterologous DNA sequence is human. 

15. An origin-primer comprising a bacteriophage phi29 replication origin. 
10 16. A method for cloning genomic DNA which comprises: 

(a) digesting genomic DNA with a restriction enzyme that generates multiple 
different cohesive-end sequences; 

(b) contacting the digested genomic DNA with a linker, said linker 
comprising a reporter nucleotide sequence and a cohesive-end sequence complementary to 

15 a cohesive-end sequence generated by the restriction cnryme of step (a), under conditions 
such that a linker is joined to a restriction fragment of genomic DNA; and 

(c) amplifying the product of step (b). 

17. The method of claim 16 wherein the digested genomic DNA is contacted with 
two or more different linkers in step (b). 
20 18. The method of claim 16 wherein the cohesive-end sequences generated by the 

restriction enzyme of step (a) are of the same length. 

19. The method of claim 16 wherein the restriction enzyme of step (a) is Sfi I. 

20. The method of claim 16 wherein the reporter sequence of step (b) comprises 
a bacteriophage phi29 replication origin, and step (c) is accomplished using bacteriophage 

25 phi29 DNA polymerase. 

21. The method of claims 16 or 17 wherein steps (b) and (c) are repeated at least 

once. 

22. The method of claims 16 or 17 wherein the product of step (b) is separated 
from the digested genomic DNA before or after step (c). 

30 23. The method of claim 22 wherein the product of step (b) is separated by 

electrophoresis. 

24. A method for mapping genomic DNA restriction fragments which comprises 
(a) digesting genomic DNA with a restriction enzyme that generates multiple 

different cohesive-end sequences; 
35 (b) amplifying the restriction fragment products of step (a); 

(c) determining the cohesive-end sequence of each of said restriction 
fragments, and in accordance therewith, 

(d) aligning th restriction fragments in a physical map. 

25. The method of claim 24 wherein step (b) is accomplished by the steps 
40 comprising: 

(1) providing a library of linkers, wherein: 
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each linker comprises a bacteriophage phi29 repUcation origin and a 
cohesive-end sequence, and 

the library comprises linkers having cohesive-end sequences 
complementary to substantiaUy every cohesive-end sequence generated by the restriction 
S enzyme of step (a); 

(2) contacting the restriction fragments of step (a) with linkers from said 
library under conditions such that each restriction fragment cohesive-end is joined wiUi a 
linker; and 

(3) contacting the products of step (2) with bacteriophage phi29 DNA 
10 polymerase, bacteriophage phi29 DNA synthesis priming protein, and four different 

nucleoside triphosphates under conditions such tiiat multiple copies of each restriction 
fragment are syntiiesized under the control of said bacteriophage phi29 repUcation origin. 

26. The method of claim 24 wherein the method furtiier comprises separating tiie 
resti-iction fragment products of step (a) from one another before or after step (b). 

27. The metiiod of claim 24 wherein the metiiod furtiier comprises separating the 
restriction fragment products of step (a) from one anotiier by gel electrophoresis before or 
after step (b). 

28. The metiiod of claim 25 wherein step (2) furtiier comprises contacting tiie 
restriction fragment products of step (a) wiUi several different combinations of linkers in 
separate reaction vessels, each combination of linkers comprising two or more linkers from 
said library. 

29. The metiiod of claims 24 or 25 wherein tiie genomic DNA of step (a) is 
eukaryotic. 

30. The metiiod of chums 24 or 25 wherein tiie genomic DNA of step (a) is human. 
25 ^ 31, The metiiod of chihns 24 or 25 wherein ti» restriction enzyme of step (a) is 5/1 
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FIGURE la 



5» AAAGTAAGCCCCAAA 3 
31 TTTCATTCG6GG 5 

5' AAA6TAAGCCCCAAG 3 
3» TTTCATTCGGGG 5 

5* AAAGTAAGCCCCATA 3 
31 TTTCATTCGGGG 5 

5* AAAGTAAGCCCCATG 3 
31 TTTCATTCGGGG 5 

5» AAAGTAAGCCCCAGA 3 
31 TTTCATTCGGGG 5' 

5< AAAGTAAGCCCCAGG 3< 
31 TTTCATTCGGGG 5' 

5» AAAGTAAGCCCCAGA 3' 
31 TTTCATTCGGGG 5' 

5* AAAGTAAGCCCCAGG 3' 
3« TTTCATTCGGGG 5' 

5* AAAGTAAGCCCCTAA 3" 
3 ■ TTTCATTCGGGG 5 • 

5» AAAGTAAGCCCCTAG 3« 
31 TTTCATTCGGGG 5* 

5» AAAGTAAGCCCCTTA 3» 
31 TTTCATTCGGGG 5» 

5* AAAGTAAGCCCCTTG 3» 
31 TTTCATTCGGGG 5* 

5* AAAGTAAGCCCCTGA 3« 
31 TTTCATTCGGGG 5» 

5* AAA GTAAGCCCCTGG 3' 
3" TTTCATTCGGGG 5* 

5« AAAGTAAGCCCCTGA 3» 
31 TTTCATTCGGGG 5» 

5» AAAGTAAGCCCCTGG 3» 
31 TTTCATTCGGGG 5» 



5» AAAGTAAGCCCCAAT 3 
31 TTTCATTCGGGG 5 

5* AAAGTAAGCCCCAAC 3 
3* TTTCATTCGGGG 5 

5' AAA GTAAGCCCCATT 3 
3« TTTCATTCGGGG 5 

5* AAAGTAAGCCCCATC 3 
3» TTTCATTCGGGG 5 

5» AAAGTAAGCCCCAGT 3 
31 TTTCATTCGGGG 5^ 

5' AAAGTAAGCCCCAGG 3 
31 TTTCATTCGGGG 5' 

5» AAAGTAAGCCCCAGT 3 
31 TTTCATTCGGGG 5« 

5» AAAGTAAGCCCCACC 3" 
31 TTTCATTCGGGG 5» 

5» AAA GTAAGCCCCTAT 3' 
3» TTTCATTCGGGG 5» 

5» AAAGTAAGCCCCTAG 3' 
3' TTTCATTCGGGG 5» 

5» AAAGTAAGCCCCTTT 3* 
3* TTTCATTCGGGG 5» 

5« AAAGTAAGCCCCTTG 3* 
3» TTTCATTCGGGG 5« 

5» AAAGTAAGCCCCTGT 3» 
31 TTTCATTCGGGG 5« 

5* AAA GTAAGCCCCTGG 3* 
31 TTTCATTCGGGG . 5» 

5» AAAGTAAGCCCCTGT 3« 
31 TTTCATTCGGGG 5» 

5" AAAGTAAGCCCCTCC 3' 
31 TTTCATTCGGGG 5» 
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5' AAAGTAAGCCCCGAA 3' 
31 TTTCATTCGGGG 5« 

5» AAAGTAAGCCCCGAG 3» 
3» TTTCATTCGGGG 5» 

5* AAAGTAAGCCCCGTA 3» 
31 TTTCATTCGGGG 5» 

5' AAAGTAAGCCCC6TG 3* 
31 TTTCATTCGGGG 5» 

5» AAAGTAAGCCCCGGA 3» 
3» TTTCATTCGGGG 5« 

5 ' AAAGTAAGCCCCGGG 3 ' 
31 TTTCATTCGGGG 5» 

5' AAAGTAAGCCCCGCA 3* 
31 TTTCATTCGGGG 5« 

5» AAAGTAAGCCCCGGG 3« 
31 TTTCATTCGGGG 5» 

5' AAA GTAAGCCCCGAA 3> 
31 TTTCATTCGGGG 5» 

5* AAAGTAAGCCCCGAG 3* 
31 TTTCATTCGGGG 5» 

5» AAAGTAAGCCCCGTA 3« 
31 TTTCATTCGGGG 5» 

5' AAAGTAAGCCCCCTG 3* 
31 TTTCATTCGGGG 5* 

5' AAAGTAAGCCCCGGA 3' 
31 TTTCATTCGGGG 5' 

5« AAAGTAAGCCCCGGG 3' 
31 TTTCATTCGGGG 5* 

5* AAAGTAAGCCCCGCA 3* 
3* TTTCATTCGGGG 5» 

5* AAAGTAAGCCCCCCG 3' 
3« TTTCATTCGGGG 5" . 



lb 

5' AAAGTAAGCCCCGAT 3» 
3t TTTCATTCGGGG 5» 

5 » AAAGTAAGCCCCGAG 3 » 
3 • TTTCATTCGGGG 5 • 

5» AAAGTAAGCCCCGTT 3» 
3 • TTTCATTCGGGG 5 » 

5» AAAGTAAGCCCCGTC 3» 
31 TTTCATTCGGGG 5» 

5» AAAGTAAGCCCCGGT 3* 
31 TTTCATTCGGGG 5» 

5' AAA GTAAGCCCCGGG 3» 
31 TTTCATTCGGGG 5« 

5* AAAGTAAGCCCCGGT 3* 
3' TTTCATTCGGGG 51 

5* AAAGTAAGCCCCGCC 3 *: 
31 TTTCATTCGGGG 5» 

5» AAA GTAAGCCCCCAT 3» 
31 TTTCATTCGGGG 5> 

5» AAA GTAAGCCCCCAC 3» 
31 TTTCATTCGGGG 5* 

5» AAA GTAAGCCCCCTT 3 » 
31 TTTCATTCGGGG 5> 

5' AAA GTAAGCCCCGTC 3' 
31 TTTCATTCGGGG 5« 

5» AAAGTAAGCCCCGGT 3» 
3* TTTCATTCGGGG 5» 

5» AAAGTAAGCCCCCGC 3« 
31 TTTCATTCGGGG 5" 

5» AAAGTAAGCCCCCCT 3' 
3' TTTCATTCGGGG 5« 

5* AAAGTAAGCCCCCGC 3« 
3» TTTCATTCGGGG 5' 
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